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Abstract 

We consider a multi-class G/G/l queue with a finite shared buffer. There is task admission and server scheduling 
control which aims to minimize the cost which consists of holding and rejection components. We construct a policy 
that is asymptotically optimal in the heavy traffic limit. The policy stems from solution to Harrison-Taksar (HT) 
free boundary problem and is expressed by a single free boundary point. We show that the HT problem solution 
translated into the queuelength processes follows a specific triangular form. This form implies the queuelength control 
policy which is different from the known cfi priority rule and has a novel structure. 

We exemplify that the probabilistic methods we exploit can be successfully applied to solving scheduling and 
admission problems in cloud computing. 


Keywords: Multiclass G/G/l queue; Brownian control problems; Harrison-Taksar free boundary problem; Shared 
buffer system; State dependent priorities 


1 Introduction 

We consider the problem of finding asymptotically optimal (AO) controls for the multiclass G/G/l queue with a 
single shared buffer, in heavy traffic. The system is characterized by I classes of arriving tasks. Each task class has 
its designated queue. The queues share a single buffer, such that the total number of tasks in all queues is limited 
by its size. We assume that tasks of all classes occupy equally sized storage slots. Upon arrival of a task of class i 
(with i £ {1,..., /}), a decision maker (DM) may either accept or reject it. If the task is admitted it joins the tail 
of one of the I queues. In addition, the DM controls the fraction of effort devoted by the server to the task at the 
head of queue i, for each i. Denote holding cost per time unit and rejection cost per customer of class-i as hi and ri, 
respectively. Denote reciprocal mean service time by p.i. We refer to the two elements of control as admission control 
and scheduling control. The problem considered is to minimize the combination of holding and rejection costs. The 
motivation for this setting is inspired by the results demonstrated in [3] . However, there is crucial difference which is 
expressed in the buffer structure. 

We assume a critical load condition and observe the model at the diffusion scale. In the scaling (diffusion) limit, 
the heavy traffic limits of Queuing control problem (QCP) turn to be a Brownian control problem (BCP). It is 
shown in [3] that there is an equivalence of an /-dimensional BCP and reduced BCP (RBCP), where a workload is a 
one-dimensional controlled state process. See references therein for additional discussion on BCP reduction. 

The specific one-dimensional RBCP is related to Hamilton-Jacobi-Bellman (HJB) equation which, in our setting, 
takes the form of an ordinary differential equation. The solution to this problem was analyzed by Harrison and Taksar 
[10] , as a singular control problem for a Brownian motion (BM), and is given by a reflected BM (RBM), and by a 
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free boundary point of the workload, denoted by x. Thus, we are interested in the solution to the one-dimensional 
workload control in the interval of the form [0, x*]. 

Hence, we treat Harrison-Taksar free boundary problem, specializing in the shared buffer. While the admission 
control is based on the simple indexing of r;/x; and the free boundary point x*, the scheduling control is complex. 
Namely, the difficulty comes from the shared buffer constraint, and is expressed in understanding the structure of 
the holding cost of the workload process. The expression for the workload holding cost, denoted by h, (equation 
(ED), is motivated by the aforementioned equivalence of the BCP and RBCP. In particular, h{w), where w is a given 
workload point, is minimized. In contrary to the case of dedicated buffers, the solution for the holding cost in case 
of a shared buffer, is not immediate. Therefore, we formulate and solve this problem as a linear programming (LP) 
problem. Additionally, the solution is numerically demonstrated in this work. 

Using the solution obtained from the LP and the simple rg indexing, we construct a particular AO admission and 
scheduling policy which is specified by the free boundary point that is used in solving the BCP. To shortly summarize 
the policy, the domain of the queuelengths in a shared buffer can be geometrically represented by an /-simplex in the 
positive orthant. For example, in 2-dimensional system, it can be described by a triangle and in the 3-dimensional 
system by a tetrahedron. We show that the solution is such that the queuelength process always follows one of the 
edges of the simplex. Hence, because of this structure of the queuelength, we merely term it as triangular policy. 

The indexes of hi and fM are used for scheduling. We make a comparison between the system of dedicated buffers 
for all task classes, the case which analyzed in [3] (we name it as a rectangular case), and the shared buffer system 
which is analyzed here, the triangular case. We demonstrate that the priority indexing differs from the one used 
for the rectangular case and, in particular, from the eg, priority rule ([6]). In contrary, as we analyze in section [5] 
the resulting solution shows that there are at most two classes that can be concurrently present in the shared buffer. 
The key for setting the indexes stems from partitioning of the workload (one-dimensional) scale as follows. First, the 
buffer is filled up with tasks of the index, which is picked by the lowest higi. This is the first index. Next, then the 
buffer is full and the workload increases, tasks of second index class are accumulated, at expense of tasks of the first 
one. Once the workload continues to grow, the classes with higher indexes gradually replace those with the lower 
indexes. Thus, there are J < / workload intervals, such that a unique pair of classes is designated to each interval. 
The indexes of the classes which fill the buffer are found by the LP. 

Following the LP solution at the limit, we formulate the asymptotically optimal policy, such that when the 
workload level is below x*, all arrivals are admitted, with high probability. That is, forced rejections, which occur 
when the buffer size constraint is reached, have low probability. This is done by assigning dynamic priorities, such 
that classes which do not belong to the pair of classes which fill the buffer are immediately served. As a result, 
nearly all rejections occur when the workload exceeds x*, and only from one class, which is identified by the lowest 
rg product. Under the AO policy, the /-dimensional queuelength process converges to the process solving the RBCP. 
This convergence is a form of a state space collapse (SSC). Namely, the queuelength process limits are dictated by the 
workload process limit. Note that the full statement of AO is accomplished by proving that the BCP value function 
is a lower bound on the limit inferior of QCP costs under any sequence of policies. This part is sufficiently general 
and is not covered in this paper. (See in [3j, Theorem 1 for the details and the proof). 

Various models associated with BCP were treated in 0, UD, 0, 0 , [12] and others. Examples of characterization 
of the BCP and RBCP value functions as solutions to HJB equations are found in Scheduling policies, such 

as eg rule and extensions can be also studied from [Bj and m, and references therein. A general setting of SSC was 
considered in [5] and ) 19j . 

We finalize the introduction by bringing the practical motivation for the described setting. Our primary interest 
in this problem comes from recent developments in the application area of cloud computing. In particular, the 
constantly growing intensity of incoming, outgoing and traversing traffic in the public cloud (e.g. Amazon) makes the 
diffusion approximation, which we use as our main analytical tool, rather practical. We bring two concrete practical 
examples. First, consider a hybrid cloud, where a private cloud (namely, a local server ) has a given capacity and 
memory limits. The tasks are served by sharing the computing effort among various task types. Once the incoming 
tasks cannot be queued for the local processing, they are rejected from a private cloud and sent to a public cloud, 
where a fixed charge per usage applies. In this context, [3] only treated the case where locally processed tasks compete 
for the server, having dedicated storage resources. In our case, the problem introduces additional motivation; namely, 
tasks of different types compete with each other over the storage resources as well. In this work, the common storage 
is modeled by a shared buffer. The second example refers to virtual machines (VM) allocation, a procedure which is 
performed by a hypervisor residing on the side of the cloud’s hardware. The VM allocation is based on a common 
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computational resource, and their total number is naturally limited by hardware constraints. Since VM for various 
types of tasks are instantiated by a single hypervisor, one can view this resource as a buffer. Different types of 
tasks have different communication and computation demands which are expressed in differentiated service effort. 
(Assume the overhead and order of VM release and allocation are embodied in the service rate.) The rejection stands 
for inability to allocate an additional VM, while the rejection price stands for the VM migration cost. For further 
details on modeling of hybrid cloud applications, control and additional practical examples for the model analyzed 
in this work the reader is referred to [16], [T5] and references therein. 

Notation 

We will use the following notation. Given k G N, {e^\ i = 1,..., k} denote the standard basis in R fc . For x G R, 
x + = max(i, 0). For a, b G R fc , a = b = (&i)i =1) , we denote ||a|| = Yl!i=i l a *l an d a-b = Yl!i=i a ibi- For 

y : R + —> R fc and T > 0, ||j/||t = sup tg [ 0 T j ||y(t)||. The modulus of continuity of y is given by 

w T (y;6) =sup{||y(s) - y(t)\\ : s,t G [0,T],|s-t| < 6»}, 9,T > 0. 

Denote A[s, t] = A(t ) — A(s) for any process A. 

The structure of the rest of the paper is as follows. In sections [2] and [3] we bring the definitions, which, in most 
parts, closely follow the definitions from [3]. We start with the queuing and diffusion models. We next define the 
Brownian control problem (BCP), and the reduced Brownian control problem (RBCP). Proposition 13.11 shows the 
relation between these two problems. These components are consistent with [3], and we bring them here because we 
will use them in the proof in section [5] We analyze HT solution for the shared buffer and bring a detailed numerical 
example in section [4] Section [y] constitutes the formulation of nearly optimal scheduling and admission policy. In 
Section [3] we state and prove Theorem 16.11 which, altogether with the lower bound stated in [3], implies the AO of 
the proposed policy. We conclude in Section [7] 


2 Queueing and diffusion models 

We start with definitions of queueing and diffusion models. Consider a sequence of systems, indexed by a superscript 
n G N. The system has a single server and single shared buffer, where tasks of all classes can occupy exactly one slot 
per task. The capacity of the buffer is limited. Customers that arrive at the system are judged by the DM to either 
be accepted or rejected. Those that are accepted are queued in the common queue. Within each class, service is 
provided in the order of arrival. Processor sharing is allowed, in the sense that the server is capable of serving up to 
I customers (of distinct classes) simultaneously, but the service cannot be shared within the customers of the same 
class. Denote an allocation vector of size I, representing the fractions of effort dedicated to the classes, and is chosen 
from the set 

B~ {/3gR 7 + :^/3i < l}, 

i£Z 

where, X = {1, 2,..., /}. 

We assume a given probability space (I?, T ', P) with expectation w.r.t. P denoted by E. Arrivals occur according 
to independent renewal processes. The parameters A" > 0, i £ 1, n £ N, satisfying A™ = n\i + ^/n\i + o{y/n), with 
fixed positive Ai and A ; G R, represent the reciprocal mean inter-arrival times of class-* tasks in the n-th system. 
The parameters /*" > 0, i G X, n G N, satisfying y" = ny,i + y/njii + o(^/n), with fixed positive fu and /*; G R, 
represent the reciprocal mean service times of class-* tasks in the n-th system. Let {IAi(l) : l G N}igi be independent 
sequences of strictly positive i.i.d. random variables with mean E[L4i(l)] = 1, * G X and squared coefficient of 
variation Var(L4i(l))/E[L4i(l) 2 ] = Cf^. G (0, oo). The number of arrivals of class-* customers up to time t, for the 
n-th system, is given by 


A"(t) = Ai(Xit), where Aj(t) = sup > 0 : IAj(k) < t > 0. (1) 

fc=i 

Let independent sequences {STi(l) : l G N}igi of strictly positive i.i.d. random variables be given, with mean 
E[5Ti(l)] = 1 and squared coefficient of variation Var(5 , Ti(l))/E[S'Ti(l) 2 ] = Cg T . G (0,oo). The time required to 
complete the l -th service to a class-* customer in the n-th system is given by STi(l )/\" units of time dedicated by 
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the server to this class. This is also known as potential service time processes, that is 

i 

S™(t) = Si(p™t), where ^(t) = sup jz > 0 : ST^fc) < tj, t > 0. (2) 

k=1 


5f(t) is the number of class-i jobs completed by the time when the server has dedicated t units of time to work on 
jobs of this class. We assume the sequences { IAi } and {STi} are independent. We assume the critical load condition 


iei 


where 


A; • f- v 

pi = —, * el 

Pi 


(3) 


The number of class-i rejections until time t and customers present at time t in the n-th system is denoted by 
Z™(t) and X"(t), correspondingly. Since rejections occur only at times of arrival, we have 

znt) = [ z^dA^s) (4) 

o,*I 

for some process z"’ 1 . 

We call X n = (XJ l )i e x the queuelength process. We assume, deterministic X[‘(0) and that no partial service has 
been provided to any of the tasks present in the system at time zero. Let B n = (B")i S z be a process taking values 
in the set B. Then 

f B?{s)ds (5) 

Jo 

gives the time devoted to class-i customers up to time t. The number of service completions of class-i jobs during 
the time interval [0, t] is given by 

D?(t) ** S?(T?(t)). (6) 

We thus have 


x?{t) = x?(o)+Am - Dm - zm = *?(o)+- srcrrco) - z?®, t > o. 


(7) 


It is assumed that B n , S n , Z n , D n , X n , T n have RCLL sample paths. Next define a rescaled version of the processes 
at diffusion scale as 

Ai(t) = y ^ , Si(t)= * w / _ p * , !£l, (8) 


Z n {t) = ^0-, X n (t) = 




X n {t) 

y/n 


The shared buffer structure is specified as follows 

x = {y e ®+ : 5Z y i - 


We assume that the rescaled initial condition _Y n (0) also lies in X. The buffer constraint is always met, namely: 

X n (t) € X, t > 0, a.s. (9) 

The rejection mechanism assures the condition above by rejecting arrivals occurring at a time t when (X n (t—) + 
e' % ))/y/n X. We refer to these rejections as forced rejections. In our setting, we distinguish them from admis¬ 
sion/rejection decisions which are part of the control process. Note that the actual un-normalized buffer size scales 
like sjn. 

The control process U n = (Z n , B n ), which is determined based on observations from the past (and present) events 
in the system, is defined as follows. 


Definition 2.1: (Admissible control, QCP) Fix n € N and consider fixed processes ( A n , S n ) given by (JTJ) and (f2|). 
A process U n = ( Z n ,B n ), taking values in R^_ x B, having RCLL sample paths with the processes Z\\ iei having 
nondecreasing sample paths and given in the form ©, is said to be an admissible control for the n-th system if the 
following holds. Let the processes T n , D n , X n be defined by the A u and S n and control processes, (A n ,5 n ) and 
( Z n , B n ), via equations (T51l . © and 0. respectively. Then 
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• ( Z n ,B n ) is adapted to the filtration o{A"(s), D™(s), i £ T, s < t}; 

• One has a.s., that, for all i £ T and t > 0, 


X?(t) = 0 implies B"(t) = 0. (10) 

An admissible control under which the scaled version X n of X n satisfies © is said to satisfy the buffer constraint. 

We denote the class of all admissible controls U n = ( Z n ,B n ) satisfying the buffer constraints, by I4 n . Fix a > 0, 
h £ (0, oo) 7 and r £ (0, oo) 1 . For each n £ N consider the cost 

J"(f/") = e~ at [h- X n (t)dt + r ■ d£"(/)]] (11) 

The QCP value is given by 

V n = inf J n (U n ). (12) 

!/"6W“ 

Denote by 9 n = {9 r f)i^i, 9™ = 1/p ", and 9 = (#i)igz, 9i = 1 / pi. The process 9 n ■ X n , referred to as workload, its 
normalized version 9 n ■ X " and its formal limit, 9 ■ X, will take part in the state-space collapse. 


3 The Brownian control problems 


We address now the limit problems. First, we show that the scaled processes defined in the previous section give 
rise to the limit processes of the queuelengths. Hence, the definition of the /-dimensional BCP of the queuelengths 
follows. Next, we define the one-dimensional BCP of the workload. Finally, we show the equivalence of the value 
functions of these two problems. 

Using ©. 0 and the definition of the rescaled processes, the following identity holds for i £ I and t > 0: 

x?(t) = X”(0) + WT(t) + Yi n (t) - Z?(t), (13) 

where, denoting m, = A i — pifii, m ™ = Ai = rra + o( 1), 

writ) = A?it) - SUTTit)) + m"t, (14) 

and 

Y?it) = £=ipit-Trit)). (15) 

Since JT pi = 1 an d one always has JT B™(t) < 1, it follows that 

gn. yn a nonnegative, nondecreasing process. (16) 


We are interested in limit problem, where the diffusion coefficient n is taken to infinity. As far as the real system 
is concerned, we view configuration where task arrival rate and, consequently, the service rate grow large. This 
connection makes the following model to be of a particular practical interest. Formally, the limits of ® m and 
(HU GU give rise to a control problem associated with diffusion. Consider equation ®. We assume that the 
scaled initial conditions A' 7l (0) converge to x as n —» oo. Next, the centered, rescaled renewal process A " [resp., S'"] 
converges weakly to a BM starting from zero, with zero mean and diffusion coefficient \f\iCijp [resp., fIiC ST i ] (see 
Section 17 of [4|). Assume the processes involved in (1131) give rise to a limiting BCP. Then it follows Y n in (1151) are 
order one as n —> oo and, consequently, T n (t) converge to pt. Thus, applying the time change in the second term 
of r.h.s. of (HU, one sees that W n converges to (m, ct)-BM starting from zero, with drift vector m = (mi)iex and 
diffusion matrix cr = diag(<r;), where 


Vi ■— \iC IA i + PiCg T ipi — A i{C IA i + Cg T i). 


As for Y n , it gives rise to a process Y such that 9 ■ Y is nonnegative and nondecreasing. The process Z n gives rise 
to a process 9 ■ Z with nonnegative, nondecreasing components. 
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The BCP 


Definition 3.2: (Admissible control, BCP) An admissible control for the initial condition xo £ X is a filtered 
probability space (T2', J-', {J-' t }, P') for which there exists an (m,cr)-BM, W, and positive process U = ( Y,Z ), with 
RCLL sample paths, such that the following conditions hold: 


• W, Y and Z are adapted to 

For 0 < s < t, the increment Wt — W s is independent of J~' s under P 7 ; 




• With 
one has 


8 ■ Y and Zi, i = 1,..., I, are nondecreasing; 
X(t) = x 0 + W(t) + Y(t)-Z(t), t> 0, 
X(t) £ X for all t, P^a.s. 


(17) 

(18) 

(19) 

( 20 ) 


In what follows we will always assume that (Y, Z) are admissible controls. Denote the class of such controls as A(xo), 
where xo stands for the initial condition. Let 


J(xq, Y, Z) = ®[f e~ at [h ■ X t dt + r ■ dZ(t)]j. 


The BCP is to find (Y, Z) that minimize J(Y, Z) and achieve the value 

Ffoo) = inf J(x,Y,Z). 

( Y,Z)eA(x 0 ) 


( 21 ) 


( 22 ) 


The RBCP 


The reduced one-dimensional problem is obtained as follows. Multiply equation d and the processes involved in it 
by 8. Denote xo = 9 ■ xo, fh — 0 ■ m and a 2 = ^ 9 2 a 2 . Let 

x = max{# ■(:(£!’}. (23) 


Definition 3.3: (Admissible control, RBCP) An admissible control for the initial condition xo £ [0, x] is a filtered 
probability space {J r t , },P / ) for which there exist an (fh, (j)-BM, W, and a positive process U = ( Y,Z ) with 

RCLL sample paths, such that the following conditions hold: 


• W, Y and Z are adapted to {Xt}', 

• For 0 < s < t, the increment Wt — W s is independent of T' s under P'; 

• 

Y and Z are nondecreasing; 

• With 

X(t) =x 0 + W(t) + Y(t ) - Z(t), t > 0, 

one has 

X(t) £ [0, x] for all t, P^a.s. 


(24) 

(25) 

(26) 


We write * 4 ( 2 : 0 ) for the class of admissible controls for the initial condition xo- Given (Y, Z) £ * 4 ( 2 : 0 ), let 

J(x 0 ,Y,Z) =E[^ e~ at [h(X t )dt + fdZ(t)}], 


( 27 ) 
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where the holding cost and the rejection cost for the one-dimensional problem are defined as follows: 

h(w) = min{h • £ : £ £ X, 8 ■ £ = w}, w £ [0, x], 
f = min{r • 2 : z £ R+, 8 ■ z = 1}. 

One interprets f as a minimal rejection penalty per unit of work. Note that h is convex by convexity of the set X. 
Note also that as members of (0, oo) 1 , 8 and h cannot be orthogonal, thus h(w) > 0 for any w > 0. Since h( 0) = 0, 
it follows that h is strictly increasing. Let 


V(x 0 )= inf J{x 0 ,Y,Z). 

(Y,Z)eA(x 0 ) 


The following definitions will relate the two problems as follows. See, that the extremal points of the set {z € 
R+ : 9 ■ z = 1} are precisely 8~ 1 e^\ namely i e 1. Hence there exists (at least one) i* such that £ = ^ 

satisfies 

C £ argmin{r ■ z : z £ R+, 8 •«=!}. 


Fix such i* and the corresponding See that i* can be expressed via 

ri*m* = min n/M. 


(28) 


Next, let 7 : [0, x] — > X be Borel measurable, satisfying 

7 (w) £ argmin{/i. • £ : £ £ X,8 ■ f = w}, w£[0, x], (29) 

{ 

By definition, 7 ( 10 ) £ X, 9 ■ 'y(w) = w, and h ■ 7 (w) = h(w) < h ■ £ for every £ £ X for which 8 ■ £ = w. Hence, as 
it will become clear soon, these definitions of the costs will imply the equivalence of the value functions of BCP and 
RBCP. 


Remark 3.1: One observes that the rejection process in the problem of dedicated buffers (denote it as a rectangular 
case) and that of a shared buffer (which we term as a triangular case) will follow the same rule. However, as opposed 
to [3], the solution for h(w) in the case of shared buffer is not immediately understood. We analyze this in the sequel 
of this section. 


We now state the key proposition which determines the relation between the BCP and RBCP. This proposition, 
together with proposition 14.21 stated in sequel, leads to the formulation of the AO control policy. 

Proposition 3.1: Let *0 £ X and xo = 8 ■ xq. 

i. Given an admissible control (12', {J~t}, P', W, Y, Z) for x for the (multidimensional) BCP, define (W,X,Y, Z) 

by (8 ■ W, 8 ■ X, 8 -Y, 9 ■ Z). Then (Y, Z) £ A(x 0 ) and J(x 0 , Y, Z) < J{x 0 , Y, Z). 

ii. Conversely, let an admissible control (12', T', {J 7 }, P', W, Y, Z) for xo for the RBCP be given, and assume the 
probability space supports an (m, cr)-BM W. Assume W is {T^j-adapted and satisfies 8-W = W and (11711 . Construct 
(X,Y,Z) by 

X(t)= 7 (X(f)), Z(t) = CZ(t), (30) 

Y(t) = X(t)-x 0 -W{t) + Z{t). (31) 

Then (Y, Z) £ ^l(a:o), and J(x 0 , Y, Z) < J(x 0 , Y, Z). 

iii. Consequently, V(xo) = V(xo). 

Note that the proposition holds for any convex h. Hence, we skip the proof, which the interested reader can find 
in 0 . 
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4 The Harrison-Taksar free boundary problem 

We now focus on the one-dimensional problem. The function V is C 2 [0, x] and solves the following Bellman equation 

[t^ 2 /" + fhf' - af + h] A f A [r - /'} = 0, in (0, x), 

< (32) 

/'(0) = 0, /'(x) = f. 

This result has been demonstrated by Harrison and Taksar [10]. They showed that an optimal control is such that 
the process X is a RBM. We introduce a Skorohod map , next. Let a > 0. The Skohorod map on the interval [a, b], 
denoted by TLm, is map D([0, oo) : R) — > Z?([0, oo) : R) 3 . It solves Skorohod Problem, and its solution is a map 
ip -¥ (i p, rji, r/ 2 ), for a given ip, a triplet (1 p, 771 , r/ 2 ), such that 

^ = ip- 1-771 —r/ 2 , <p{b) G [o, 6] for all t, 

r/i are nonnegative and nondecreasing, and / l( a ,6](<p)rf»?i = / l[ a ,&) [p)dr }2 = 0. 

J [0,oo) J[0, 00 ) 

See [14] for existence and uniqueness of solutions, and continuity and further properties of the map 
it is well-known that P[ a ,b\ is continuous in the uniformly-on-compacts topology. 

The following proposition is mostly a result of [121- 

Proposition 4.2: The function V is C 2 on [0, x] and solves (1321) uniquely among all C 2 functions. Denote x* = inf (t/ G 
[0, x] : V' (z) = r for 2 G [y,x]}. Then x* G (0,x). Fix xo G [0, x]. Let W be an (m, (t)-BM and let X, Y and Z be 
the corresponding RBM on [0, x*] and boundary terms for 0 and x*, defined as 

(X,Y,Z) = r\ [0 ^ ] (xo + W). (35) 

Then ( Y,Z ) is optimal for V(xa), i.e., J(xo,Y, Z) = V(xo). 

The reader is referred to [3] for the proof. An optimal control for the BCP stems from propositions 13.II and m 


(33) 

(34) 

. In particular, 


4.1 Solution for h 

As mentioned, the specifics imposed by the shared property of a buffer implies non-trivial solution to the one¬ 
dimensional holding cost. The structure of the workload process is such that X = 6 ■ X is given as a RBM on [0, x*], 
where the free boundary point x* is dictated by the Bellman equation. The multidimensional queuelength process 
X is recovered from X by X = y(X). 

Recall that the shared buffer domain has the form 


X = {* G R+ : 0 < Xi < b, i G Tj, (36) 

;=i 

for some fixed b > 0. As mentioned, there is no difference between the cases of the shared and the dedicated models, 
in rejection structure. Namely, the BCP solution implies that in the scaled model, rejections should occur only when 
the scaled workload exceeds the level x*, and only from class i *, for which rip,i is minimal. Next, the relation 

X n = 7 (0 • X n ) + o(l) (37) 

between the queuelength and workload processes must hold. We solve for the minimizing curve 7 , where X takes the 
form (1361) . Equation (1291) can be written as 


1 

7 (w) G argmin{h ■ x : 0 < Xi, 0 < < b and 9 ■ x = in}, w G [0, x]. 
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To compare a rectangular system and a triangular system, one sees that the description of h in the latter case is 
not trivial. This is because the minimization is done under restriction of the shared buffer. Namely, in rectangular 
case, it is clear that as long as the workload grows, the buffers with lower priorities are filled up until their limits are 
reached. However, in triangular case, one have to explicitly solve for h in order to understand which task types are 
being preferred for each workload point. 

Hence, we find h by the linear program (LP). Assume first the buffer is full, and the total workload is w. We later 
extend the solution for the workload points where the buffer is partially empty. The standard LP form is written 

i 

minimize J2 h iXi w.r.t 

i=1 

I I 

y diXi = w, Xi =b (38) 

i= 1 i= 1 

Without loss of generality, consider two classes, denoted by class 1 and class 2, such that 9 2 > #i, and 81 b < w, 
62 b > w. These conditions are general, and will be needed for the convenience of the canonical representation of the 
LP. Rewrite (138I I in matrix canonical representation: 


minimize 

hi (w — 62 b ) h 2 ( 9ib — w) 


9 1 — 6*2 
w.r.t 


+ ■ 


9\ — 92 


, , u t 9 3 — 9 2 u 9i — d 3 9i — 9 2 81-81 

+ (h 3 — hi- --- h 2 - - 7 r ) x 3 H- h {hi — hi- --- hi- -- -)xi 


' Xl 


X2 


(83 — 82) 

8 i— 8 2 

(81-83) 

8i—8 2 


X3 

X3 


(84—82) 

8i—8 2 

(81-84) 

81 —8 2 


*4 

X 4 


9 1 — 9 2 

( 84 — 82 ) 

84-82 

( 81 - 84 ) 

81—82 


9i — 9 2 ' 


XI 


\ 

/ bG-2 —w \ 


82-81 


1 w-bG-i 1 


V 82-81 ) 


9i — 92 


9i — 92 ' 


(39) 


In the case all the coefficients of the new objective function in the above display are positive, by [Theorem 3.4.1, 1 171 1 
the minimum for h is achieved at 

, b9 2 — w w - b9 1 
x = i-b - TTi-n - 75 5 °?'' ' ,0}, 


and is expressed by 


62 — 9i 9 2 — 9 1 


t hi(w — 9 2 b) | h 2 ( 9 ib — w) 

7 1 


9i — 92 


9i — 9 2 


Alternatively, assume (h 3 — hi — h 2 e g ]_gl ) is negative. Then, we act according to the simplex algorithm, 


presented in na and perform a pivot operation. This gives the following problem: 


minimize 

hi (w — 9 3 b) h 2 (9ib — w) 


9 1 — 9 3 

w.r.t 


+ ■ 


81 — 9 3 


1 tu t 8 2 — d 3 9i 82 ^ ( u 8 i — 6 3 t Si-9j 

+ {h 3 - hi- - - - h 2 - - -r-)x 2 H-h (hi - hi- --- hi- -- -)xi 


9 1 — 9 3 


" 9i — 9 3 


9i — 8 3 


81 — 9 3 ' 


' Xl 


(82 83) 

81-83 x 2 

(84-83) r 

81-83 X4 

- ( 


(81-82) 
81-83 x2 

X 3 

(81-84) 

81-83 X4 

' = 

V 




/ V 


w—bG 1 

O 3 —O 1 


(40) 


One sees that the solution to this problem has a similar form. Hence, in the case all coefficients in the function to be 
minimized are positive, the minimum for h is achieved at this time 

f bd 3 - w w - bdi 

x = {~n -7-. 0,-7-.0}, 

t/3 — (71 (73 — C/1 


^ _ hi (w — 9 3 b) h 3 ( 81 b — w) 


9 1 — d 3 


9 1 — 9 3 


and is expressed by 
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See from [l7j, (section 3.8) that there is a finite number of pivot steps which brings the canonical representation 
above to the form with objective function with non-negative coefficients. (There are additional options, such that the 
objective function is unbounded and/or degenerate cases which we rule out since they imply non-physical properties 
of the problem setting.) Therefore, for any w, the minimal h(w) is achieved when only tasks belonging to at most 
two classes are present in the buffer. Note that the solution above is not necessarily unique, as more than one pair 
of 9i may satisfy the conditions above. However, once initial workload is at 0, the tasks of some class, denote it for a 
moment as j, are accumulated first. This class has the lowest hj/9j. That is, the first type of task to be accumulated 
in the buffer follows the well-known c/r rule, till it holds w = b9j. We derive next the rule how to find the two tasks 
which minimize the h, for w > b9j. See that the maximal workload associated with tasks of (the cheapest) type j, is 
the lowest workload possible provided the buffer is full. Now assume that additional amount of workload worth of e 
is added. This workload addition is translated into reduction of number of tasks of type j and addition of tasks of 
other types. Denote the number of tasks subtracted from type j as X. Then, the number of added tasks of all other 
types is JT =1 CiX, for non-negative a, such that YLl=i i^j ~ 1- Total workload change is given by 


e = Ci X6 i - xe i 

»=M/j 

Then, the total number of tasks of type j that were displaced (by saying displaced we mean served, and no additional 
tasks of that type were admitted) is 

X = — --- 

J2i=l,ijtj C i&i ~ @3 

We find the optimal cost associated with the workload addition worth of e, denote it as P. 

Y i=ii c i h i ~ h i) 

P= Y aXhi - Xhj = Y ’ -- 

i=l,*#3 Cih ~ 

Alternatively, cost addition ’’per workload worth of e” is 

Y2i=l,i^j c ihi — hj 


P/ e ~ 


Y2i=l,i^j c i9i 9j 

The following lemma is a straight-forward consequence of the theorem [3.4.1] mentioned above: 


(41) 


Lemma 4.1: The optimal combination of Ci with respect to (SU) is such that there exists one Ck = l,k ^ j and Ci = 0 
for i ^ k. 

See that once the buffer is full with tasks of some type j, only tasks of the type with the lowest ratio h g k k Y- are 
admitted. That is, the additional cost accumulated due to the workload increase is minimal. Heuristically, as the 
workload continues to increase, this admission pattern continues till the buffer is full with tasks of type k, while all 
tasks of type j have been displaced. Then, the tasks of the next type which comply to the similar condition are 
accumulated, instead of those of type k. The addition of the workload cannot go further if the free boundary is 
reached. Note that the first class to be admitted (when the buffer is not full) is the one with the lowest ratio We 
name this heuristically described procedure as order of accumulation. Figure |T] demonstrates the description above, 
in 4 possible cases, for systems with 3 and 2 classes. 


Remark 4.2: For more consistency, one can define 9q = 0 and h 0 = 0. Then, the expression h g k k Y hold when the 
buffer is not full as well. 


We use these results to give a formal definition for the AO policy in the next section. 


4.1.1 Numerical example of Harrison-Taksar problem and its solution. 

In what follows we present the numerical example. The graphs below demonstrate the numerical solution of Harrison- 
Taksar problem for a shared buffer with maximal occupancy given by b = 125 tasks, and other parameters given in 
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Fig. 1: The case of 2 sharing classes (right), and 3 sharing classes (left). Rejection point may be reached before the 
buffer with least priority is admitted. For a higher rejection level, the curve continues along the boundary. 


Tab. 1: Table of parameters of 3 sharing tasks 


Class 

Holding cost 

Rejection cost 

T 

A 

r(i 

Class I 

1390 

962.5 

1.80 

0.60 

1732.4 

Class II 

1050 

700 

2.20 

0.73 

1539.1 

Class III 

733 

875 

2.80 

0.93 

2450.3 


Table[l] We assumed Xi = p,i = 0 for all i (so that fh = 0), that <j 2 = 0.91, and took the discount parameter a = 10. 
The ordering of is such that class 2 is the less expensive as far as rejections are concerned. Thus, this class is 
rejected at the free boundary. 

The Bellman equation takes the form 


f [2.19/" - 10/ + h] A /' A [1539.19 - /'] = 0, in (0, 69.45), 
[/'(0) = 0, /'(69.45) = 1539.19. 


(42) 


The function h is defined by 

3 3 

h(w) = min j ^ : £ £ X, ^ 9& = w j, w £ [0, 69.45], 

i=1 i= 1 

where X = [0,125] and 9i = {if 1 , which by numerical solution of the corresponding LP is translated into the following 

732.9 • w 


h(w) & < 


0.36 


732.9(125 • 0.45 - w) 1050(w - 125 ■ 0.36) 


0.45 - 0.36 


0.45 - 0.36 


1050(125 • 0.56 - w) 1390(w - 125 ■ 0.45) 


0.56 - 0.45 


0.56 - 0.45 


0 < w < 44.64, 


44.64 < w < 56.85, 


56.85 <w< 68.06. 


Table [2] demonstrates the solution to h and the selection of classes for each of the workload intervals. This selection 
takes part in setting the 7 . Note that we used remark T4.21 
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3 

hi—hj 

e 1 - 9 i 

h, 2 ~hj 

82-81 

h ,3 — hj 
83-81 

0 

3 

2 

2501.8 

3310.3 

3.3730 

2308.7 

3245.6 

2052.4 


Tab. 2: Order of task accumulation, j stands for the class which, in case the workload grows, fills up the buffer. 

The optimal selection, within each row, is in bold. As long as the buffer is not full, only class / is admitted. 
Next, tasks of class II are admitted. As the workload grows, tasks of class I are displaced, while tasks of 
class II fill up the buffer. Finally tasks of class III ate accumulated, till free boundary is reached. We put 
blanks where the classes were already chosen for lower workload intervals. 



x 


Fig. 2: V and V' within the free boundary. See that the maximal value ofV' is equal to r at free boundary. 


The optimal curve 7 is given by the following 


'y(w) « < 


r 125 [0,0,1] 


44.64 

125 * 0.45 - w 
0.45 - 0.36 

125 * 0.56 — w 


, n „ ,, w — 125 * 0.36 r , 

0 , 0,1 4 - 0 , 1 , 0 

11,1 0.45 - 0.36 1 ’ ’ J 

[ 1 , 0 , 0 ] + W ~ J 25 *°- 45 [ 0 , 1 , 0 ] 


0 < w < 44.64, 


44.64 < w < 56.85, 


0.56 - 0.45 


0.56 - 0.45 


56.85 < w < 68.06. 
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The optimal curve 



Fig. 3: Classes which form the optimal curve 7 . X-axis stands for the workload, while Y-axis shows the number of 
tasks of each class present in the buffer at this workload. The optimal queue-lengths at free boundary are 
{0, 27.5595, 97.4405} (dashed vertical line). 


5 A nearly optimal policy 

We introduce the curve 7 for the triangular domain, given by (j36j). The parameter x associated with the RBCP is 
given by 6mb, where m = argmaxi 8i. This corresponds to the buffer being full with most processor-consuming tasks. 
The tasks of this class contribute the maximal per-task workload. In what follows, we label the classes according to 
their order of accumulation, which was heuristically described in previous section. Again, assume that the workload 
grows from 0 to x. This ordering will form the accumulation priorities p(j), and it will be used to formulate the 
mapping from w to y(w). The numbering j £ {1, ■ ■ ■ , /}, refers to the workload points Wj, where the buffer is solely 
full with tasks of some type i = p(j), such that u>j < u)j+i- (Note that in sequel we redefine this ordering in order 
to obtain i = p(i).) wo = 0 , stands for the zero workload, vh is defined as wi in the previous subsection, i.e the 
minimal load then the buffer is full. Namely, 

m = 8 n b, n = argmin/ii/ii, i £ {1, • • • , /} 
i 

The calculation of p(j) > 1 is performed recursively. 

/ij /i.p 1 j 

p(j) = argmin ---- : hi > /i p y_i), 0 i > 8 p (j-i) 

i t/i Up(j — 1 ) 

Observe that the restriction hi > 8i > # p y_!) means that no class can be chosen twice. (For otherwise that 

i would be chosen for the lower p(j)). In addition, the condition on 8i assures that the workload added by each task 
of class p(j) is higher than that of task of class p(j — 1). See that \p(j)\ < /. Note that some classes can never be 
optimal, (for example, those with comparatively high hi and low 8i) and thus be never accumulated. However, the 
tasks of class with maximal 8i are always assigned to the last j. Note that by remark T4. 2 1 one can define /i p ( 0 ) = 0 
and # p (o) = 0 to have the order of accumulation started with an empty buffer. 








Define two groups of tasks: 
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£ = {i\3j : p(j) = i } 
V = {i\$j : p(j) = i} 


Denote p m = \£ |. The group T> constitutes the task types which are always at low accumulation priority. See that 
V can be an empty set. Next, for simplicity of notation, we reorder the class indexing such that i = p(i), if i £ £ 
and i = {pm + 1 ,p m + 2, • • • , /}, if * £ T>}. That is, the tasks are rearranged in the order of their accumulation in 
the buffer (i.e. the order of i is equivalent to the order of j before the reordering). Denote J = \£\. The workload 
intervals are denoted by ojj. 

Uj = [wj-iiWj), j > 1 (43) 

Given w £ [0,x], (j,£i,£i i) = (j,£i,£ii)(w) are determined by w £ Uj and we use the solution to (TI^ll to set 

6.( w )-- 7T—&(w) = 6-£ h (w), (44) 

“7 “j —1 


in case j > 1 , and 


£l(w) 


in case j = 1. With this notation, 7 is given by 


0 , £h{w) 


ui/ 61 , 


(45) 


■y(w) = ihdj + £i8j- 1 . 

Clearly, the curve, which is expressed by 7 defined above, can lie along the boundary dX of X, where it holds 
d + X := {x £ X : JT Xi = b}, i.e. the buffer is full. Note that 7 is the solution to the BCP which, in addition, 
comes in concert with the free boundary solution found from the RBCP, according to which only rejections at the 
workload level x* are allowed. Thus, these two properties of the solution can be seen as contradicting, when treating 
the QCP. Hence, we propose a policy which approximates 7 by introducing an alternative curve, which is closed to 
7 and is bounded away from the buffer limit boundary at the same time. Let e £ (0, b) be given. Let a = b — e, and 
a* := x* A ( dja ) < x = 6 jb. In the case e is small, we have then a* = x* (unless x* = x). Note that the buffer can 
be full for various values of £i,£h- Therefore, we define additional margins that can vary according to the workload 
w. Let Ei = Ei(w) and Eh = Eh(w). For each w we set Ei and Eh as follows 

{ £1 = min{s/ 2 , &}, if 
Ei = e/2, if t/i > £?j, t/h > e/2 

si — e £h, if > £,h, £h < e /2 

Eh = e - El (46) 

Let xi = £,1 — si and \h = £h — £h- The approximation 7 “ : [0, x] —> X of 7 is as follows. For w £ [O^ja), the 
variables j = j(w) and \i = Xi( w )iXh = Xh{w) are determined via 

w = dj-ixi + 9jXh, j€ {1,2,..., J}, Xl+Xh € [0,o), (47) 

and 

7 a (ic) = + XhS^ ■ 

Note that the triplet (j,XhXh) is unique. We will refer to it as the representation ( j,XhXh ) of w via (14711 . 
w x = 81 X 1 + QhXh and W£ = 6 i£i + 9h£h, where 9i and 9h refer to the classes which are associated with £1 
correspondingly. We need the function 7 “ to be continuous on [u> x , UJf] and satisfy the relation 9 ■ 'y a (w) = w 
we define the linear interpolation between the points w x and W£ as follows: 

7 a (w) = a+— ^2L(b—a ), w£[w£,w x \. (49) 


(48) 

Denote 
and £ h , 
. Hence, 


We next specify the policy. Z n (t) defines the rejection policy, while B n (t) describes the server allocation policy, 
as a function of X n (t). In what follows the policy is set considering the index correction after the reordering above. 
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Rejection policy: 

The multidimensional rejection process Z has only one nonzero component, namely the i*-th component, which 
increases only when X > x*. This structure is translated to the queueing model. Hence, we use it to construct the 
rejection component of the asymptotically optimal policy. The latter occurs when 8 ■ X n > a*. The forced rejections 
only occur in order to keep the buffer size constraint. 


Service policy: 


For each x £ X define the low priority: 

C{x) = {j\xj < Xh, j ~ l\xj-i < xi} 

That is, the tasks of type j and j — 1, where j = j(w) have always low priority as long as their quantity is below xi 
and Xh, correspondingly. The complement set defines the high priority classes: 

H{x) := I \ {£(*)}. 

Denote H + (x) = {i £ H(x) : Xi > 0} and C + {x) = {i £ C{x) : Xi > 0}. The policy is to allocate service to all classes 
within H + (x) equal to a fraction proportional to the corresponding traffic intensities. Classes within C (x) receive no 
service, with exception of the moments then TL + (x) is empty. 


Pi(x) 


f°> 

< Pil{i6£+W> 

[ SfcgC+fi) Pk 


if l(a; i =0)U (n+(x)^H) — 1> 
if H + (x) = 0 


Pi (*) 


f°. 

< {i£H+(x)} 

y ^2ken+(x) Pk 


if Xi = 0 , 
if H+(x) 


(50) 


Define p((x) = pf(x) ■ c. + P?{%) • hen- Then for each t, 

B n (t) = p'(X n (t)). 


(51) 


Note that when H + (x) ^ 0, 


p'i(x) > pi for all i G H + {x). 


(52) 


That is, all classes which receive service are allocated a fraction of effort strictly greater than their traffic intensity. 
See that JT B ™ = 1 whenever X n is nonzero. Hence, the proposed policy is work conserving. Figure []] demonstrates 
two cases of service policy for 6 classes. The reader may also consider to go back to Table [2] to review the example 
of the order of accumulation. 


X H 



I_I 




1 2 3 4 5 6 

S N S S S N 


Xh - 



t 


T 

1 


T 


1 

2 

3 

4 


6 


s 

N 

S 

N 

S S 


Fig. 4: Schematic example of service allocation. The figures depict possible states X u (t) 
are being served, classes denoted by N are not being served. 


x. Classes denoted by S 


Remark 5.3: The tasks of type i > J (if exist) have always high priority once arrive to the system. This is because 
these tasks are never optimal to have them accumulated in the buffer, according to the solution of h. However, one 
of these task types may be chosen to be rejected at free boundary, according to the r. 

Remark 5.4: Note, that the rejection policy resembles that of the rectangular case, because the calculation of r follows 
the same rule. The class i* is the only class which is subject to the non-forced rejection. 
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6 State space collapse. 


In this section, we prove the main theorem which states the upper bound for the value function under proposed 
policy. The theorem is written as follows. 

Theorem 6.1: For each e > 0 and n, denote the policy constructed above by U n {e). 

Then, 

limsup^^ J n (U n (£)) < V(xo) + cr(e), where a(e) —> 0 as e —> 0. 

Remark 6.5: The combination of Theorem ED above and Theorem 3.1 in [3] (which states the general lower bound) 
provides the AO of the policy U n (e). 


Proof. 

For fixed e in [7 n (e), write U n = (Z n , B n ) and denote by r n the time of the first forced rejection. We perform 
most of the analysis on the processes before T n is reached. We show that the cost of QCP, under proposed policy, 
weakly converges to the value function associated with BCP solution as n —> oo. We divide the proof into two major 
steps. First step shows that the workload process 9 n ■ X" converges to RBM. This results in that only rejections from 
class i* occur, and only when 9 n ■ X n « a*. The second step, shows that X n lies close to the minimizing curve at all 
times. Hence, the running cost is locally minimized. This establishes that in any finite time, r n is not reached. 

We prove for the case where the system starts with initial condition close to the minimizing curve, namely 

A n (0) — 7 “(# • X n (0)) —> 0 as n —» oo, and 9 n ■ X” n (0) £ [0, a*] for all n large. (53) 

The assumption on initial condition can be relaxed in a straightforward manner. Shortly, given a general initial 
condition, there will be a jump towards a position located on the curve. We skip the technical details which can be 
found in [3] . 

Denote some s ', a distance from the minimizing curve 'y a (9 • x ), small enough to assure that forced rejections 
within that distance do not occur. Observe that a n < r n , where 

n a >7i 

(7 =C AC , 

C n = inf{f : A' #,n > a* +e , |, C™ = inf{t : max|A”(f)| > e'}. 

i<I 

In this sense, C™ refers to the time of violating of the free boundary of the workload, while C n refers to the time when 
distance of number of tasks of any tasks-type from the optimality curve overcomes s'. 

Now multiply equation m by the vector 9 n = (l/pj'jigi and denote 

W*’ n = 9 n ■ W n , X*' n = 9 n -X rl , Y*' n =9 n -Y n , Z*' n =6 n -Z n . (54) 


We have 

X= A' # ’ n (0) + W*’ n + Y*’ n - Z*’ n . 

Let W°’ n := A r n ) denote the process W*’ n when stopped at the time r™. Define similarly A 0 ’™, 

Z°’ n . 

The following lemma states that the free boundary is not violated (that is, C™ is not reached). 


(55) 
Y°’ n and 


Lemma 6.2: 

a.s., 


any subsequential limit (W,X,Y, Z) of the sequence (W°’ n , A°’ n , Y°’ n , Z°’ n ) is C-tight, and satisfies 

(X,Y,Z) = r [0 ^ ] [xo + W\. (56) 


The detailed proof is given in [3|. (See Theorem 4.1, Step 1 ). 

Hence, we straightly proceed to prove our main result, which constitutes the proof of the state-space collapse. 
Our objective is to show that the multidimensional process X n lies close to the minimizing curve. That is, as n —> oo, 


A n (t) := X n {t) - 7 a (A # ’ n (t)) => 0, 


(57) 
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uniformly on compacts. 

It suffices to show that P (a n < T) —> 0, for any small e' > 0 and any T. Fix e' and T. Thanks to the fact that 
a n < r n , 

P(cr n < T) < P(C" A C < T A r n ) < P(C" <TAr")+ P(C n < T A r n ). (58) 

From lemma m it follows that P(£ n < T A r n ) —> 0 as n —» oo. It therefore suffices to prove the following lemma. 

Lemma 6.3: P(f™ < T A r n ) —> 0 as n —> oo. 

Proof. 

On (" < T A t" let x n := X #’ n (£ n ) = and let j = j n and Xzfxf be the corresponding components 

from the representation (j, xi, Xh ) of x n (with w = x n ). 

The proof strategy is to define a covering of the workload domain [0, x] by intervals. Then, we assume w being 
present in one of these intervals and prove the lemma for all possible cases. More precisely, we show that it is sufficient 
to distinguish between four different types of intervals in order to treat all possibilities. 

To that end, fix a positive integer K = K(s') = [co/e / ], where Co is a constant depending only on 9. ( we treat 

this value at a later stage of the proof.) Define K — 1 intervals E k = B(fei, £i), k = 1 , 2 ,..., K — 1 , where B(x, d ) 

denotes a closed interval [x — d,x + d\ and e i = x/K. Let Sk. = B(feei, 2ei). 

We use the characterization of C-tightness as in Proposition VI.3.26 of m and apply it to X°’ n . Given <5 > 0 
there exists 8' = 8'{8, T, ei) > 0, such that for all sufficiently large n, 

|X°’ n (s) — X°’ n (t )| < £i for all s,t £ [0, T], |s — t\ < 8', with probability at least 1 — <5. (59) 

Fix such 8 and 8'. Denote by T n the interval [(£" — S’ V 0), C) 71 ]. Define the following event, 

n -n,k = < rA r",i"e E k ,X*’ n {t) £ E k for all t € T™}. (60) 

By simple probabilistic manipulations, using that X^’ n = X°’ n on [0, r n ], it can be shown that for all large n, 

P(C n <TAt") < 5 + ^P(L2™’ fc ), (61) 

k 

We fix the index k, which is associated with the fc-th interval in the workload covering, and analyze f2 n,k . We aim 
to show that P(I2 n,fe ) —> 0 as n —> oo, for each k. The value assigned by the policy to B n (see (1511) 1 remains fixed as 
X n varies within any of the intervals {uij ) defined in (1431) . However, this is not necessary the case. There are several 
cases, which we separately define and treat as it follows below. 


0 ) 

E k C (0, a*) and for all j, Wj S k . That is, X n remains in the same interval during the time window T™. Hence, 
the region ojj is constant. To prove in this case, see that all points x in E k are translated to the same j in the 
representation (j, Xh Xh) of x, as in (1471) . Note that j = j(k) depends on the interval k only, and does not vary with 
n. Also, j = j n under Q n,k . We split the proof into two separate cases corresponding to the two groups of classes - 
i ^ {j — 1, j}, i £ {j — l.j}. We also separately treat the case where j = 1, i.e. the buffer is not full. We start with the 
first group. Fix i ^ {j — 1 ,j}. We estimate the probability that, on Q n ’ k , C, n <TAt“ occurs by having A n (( n ) > e'. 
More precisely, note that 7 “(a; n ) = 0 (this follows from the policy definition in (1501) . i.e. because i {j — 1, j}). Then 
we will show that 

for every e" £ ( 0 ,£'), ¥’({2 n,k n (A' 7l (^ 71 ) > e"}) —>• 0 as n —» oo. (62) 

Note that 7 “ is continuous and that A"(0) ->0asn-> 00 , by (1531) . Using the fact that the jumps of X n are of size 

non the event indicated in (1621) there must exist rj n £ [ 0 , £"] with the properties that 

Xf ( 77 ”) < £'72, X?(t) > 0 for all t £ [r , n , {"]■ (63) 

Define ff 1 = rf 1 V (£™ — 8 '). We examine now Xf [£ n ] — Xf [i) n ] = Xf [fj n , ^™], aiming to bound it using the modulus 

of continuity. Since the probability of the latter to be positive should go to zero, the probability that the difference 
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Xf[ff, £"] is positive will go to zero as well. See that due to the position of X n (t) in interval Sk C (0, a*) (by definition 
of case (I)) there are no rejection at free boundary at all times in time interval [fj n , £”]. Hence, Zf [ff, i) 71 ] = 0. 
Since the buffer is full, we must show that the probability of the forced rejections goes to zero as n —> oo. for all i. 
Assume that jobs of classes j — 1 and j are bounded away from £; and respectively. Hence, the arrivals of class i 
are admitted and Zf[ff,Cf] = 0, with high probability. Observe that on the event in (1631) . during the time interval 
b?",C"], * is always a member of H{X n ). Recall (from definition in lf5])l T"(t) = f* Bf(s)ds, while by (1511) (1521) . 
Bf(t) = p'i(X n (t)) > pi + c, for some constant c > 0. Substitute in (1151) and make time derivative 

U7(t) < A-^(prf - A Pi + c)ds)) = -£=c (64) 

at at yjn J 0 yjn 

Note that definition of ff assures that the equations which follow next will be valid on Q. n ’ k , i.e. within the time 
interval T". That is, by (RTll) and (1(151) we have that the time interval [f) 71 , C™] is bounded by S', while Xf[ff,Cf\ on 
it is bounded by ei. Using these facts in (1131) and substituting (£" — rf 1 ) for the time interval in (1641) . we have 

xnr,a = wnc,c n ] - c^j={c - n m 

y/n 

Fix r„, a positive sequence, such that r n —> 0 and r n y/n —> oo. We use this sequence to indicate the growth of Cf — ff 
with n, referring to two different cases. The C-tightness of Wf provides a bound by means of modulus of continuity; 
that is, w T {Wf-,r n ) > Wf[ V n ,C]- 

In the first case, we assume C" ~ V n < r n and n is sufficiently large, such that r) n = rf 1 . Moreover, c-^t(^ n — ff 1 ) 
is finite and positive. Thus by definition of rf, and by the assumption that jump of size e" 12 happened, we have 
Xf[fj n ,C n ] > e" 12. Consequently, 

wr(WP-,r n )>Wr[v n ,C]>e "/2 

must hold. However, the probability of this event goes to zero as n —> oo. 

In the second case we assume that Cf — rf > r„. Hence, by m we omit e"/2 and change sides, 

uV- 

2\\Wf\\ T > Wf[ff,C] > c^=r n > cr n yff, 

V fl 

for some constant c > 0. Observe that since r n > 0, the probability of the event above goes to zero as well. 
Summarizing, the probability in (1621) is bounded by 

P(wT(Wf;r n ) > e"/ 2 ) + P(2||W/ l || T > cr n yff), ( 66 ) 

which converges to zero as n —> oo, by C-tightness of W n . This proves (1621) . Consequently, the classes which do not 
belong to the {j — 1 ,j} stay at zero with high probability. Moreover, forced rejections of these classes happen with 
low probability. 

Note that in the case i (f {j — 1, j} we demonstrated that X n (t) lies near zero. Hence, we could assume that 
7 “( X 0,71 = 0. In order to prove for the case j — 1 and j we aim to show, 

for every e" € (Cfe'), P (f2 n ’ k D {zi 7 *)^ 71 ) > e"}) —> 0 asn->oo. (67) 

That is, classes j— 1 and j are bounded away from and Ch. by predefined constants ei and Eh, respectively. Therefore, 
we will show that the distance from the curve, A^ff ,Cf\, goes to zero with high probability. For simplicity of notation, 
define C n (t) = 7 “(A'°’ 7 l (t)). Then A 71 = X" — C". Recall that by policy definition for i = j, 

j £ 1-L(X n (t)) whenever Xf{rf) > Xh- ( 68 ) 

j £ C(X n (t)) whenever Xf{rf) < \h- 

We refer to the first option first. Similarly to (1631) . there exists rf < Cf such that 

Xf {rf) < xh + e"/2, Xf (t) > X h for all t £ [rf , C]. (69) 

Again, we distinguish between two cases: (f — rf < r n and (f — rf > r n . We have now two terms of modulus of 
continuity, referring to the processes Wf and C n . Recall that we by definition, 7 “ is continuous f (1481) (1491) 1 and by 
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Lemma [6.2l X°' n (f] is C-tight. Hence, C n is C-tight. Therefore, proving similarly as for i (f { j — 1, j}, the probability 
in (1671) is bounded by 

¥(wT(Wr-,r n )+WT{C n -,rn)>£''/2)+¥{2\\Wr\\T + 2\\C n \\T>cr ri V^), (70) 

Note that in the second option of (1681) class i is not served and thus jumps immediately to the point \h with high 
probability, as n —> oo. This accomplishes the proof for i = j. The proof for i = j — 1 is similar, with the only 
difference that X h is substituted by X i. We now separately treat the case j = 1. For this case j — 1 = 0 and X i = 0, 
which is treated as for the task classes i (fi {j — l,j}, and (1621) holds. The proof for the j itself is similar to the case 
where j > 1 and 1(67)1 holds. Thus, 

for every e" £ (0, e), P (fi n ’ k D i(£ n ) > e”}) —> 0 as n —> oo. (71) 

We can now show that P(12 n,fc ) —» 0 as n —>■ oo. See that by definition of 7 , the minimizing cost, and by the 
convergence established for the workload process 9 ■ •y a (9 ■ x) = 6 ■ x for all x £ X and 9 n —y 9. Combining this 
with ( 1521 ) ( 1571 ) . (ITTll . we bound |Z\7(C n )l as follows 

P(C™’ fc n {max |Zir(0| >e"})^0, (72) 

i<I 

Since e" is arbitrarily small, it follows from the definition of that P(I2 n ’ fc ) -> 0 as n -> 00 . 


(H) 


We consider the case Si, C (0, a*) but Wj £ Sk for some j £ {1, 2,. .., J}. That is, the buffer is mostly filled with 
tasks of class j. Let (j n (t), £ n (t)) denote the representation (1471) for In the case j n > 2, in the time window 

T n , j n , ( j — l) n vary in the range of j — 1, j and j + 1. More precisely, we have to analyze the fluctuation between 
{j — 1, j} and {j. j + 1}. The case j n = 2, stands for fluctuation between 1, (then buffer is not nearly full) and {1, 2}. 
In case j n = 1 we have wo G Sk which is separately treated in case (III)- 

We first treat the case where 2 < j < J. In this case, the buffer is mostly filled with tasks of class j. See by m 
that if w > Wj, the tasks of class j are associated with \i- Define 


Xh\w) 


w — a9j 

$?+i — 



Otherwise, if w < u)j, the number tasks of type j are are associated with \h- Define 


( 2 )/ , w — a9j -1 ( 2 ), . ( 2 ) / , 

X h (w) = - ---, Xi (w) = a-x h M, 


Observe, that in the first case the buffer is full with tasks of type j — 1 ,j, while in the second case the buffer is full 
with tasks of type j, j + 1. The way we treat this is by bounding A n from above by a quantity that depends on £ 1 , 
rather than by an arbitrarily small e". Recall that £1 refers to the bound applied for the C-tightness of X° ,n in (1591) 
and is used to set the size of each interval in the covering Sk and Sk- 

Define ci = 4/# m i n and 9 m i n = min;( 6 (;+i — 9i ), where for convenience, we denote 9o = 0. We have for any 
w £ Sk, \w — Wj\ < 4ei, since Wj is also in Sk- Now, if w < Wj, then for coordinate j of the minimizing curve it 
holds 7 “(w) = ■ In this case, 

w = (a- X ?)9 h + x\ 2) 0i = Wj - X?\8h - 9i ) 

We have X [ 2 ^ < 4ei /(9j — 9j- 1), and consequently 'yj_ 1 (w) < Ae\/(9j — 0j- 1). Thus, 7 “(ui) = a — X ^ > a — 

This shows that on f2 n ’ k , 

'Yj(X*’ n (t))>a-ci£i, t£ T n , 

Otherwise, if w > uij, it follows that 7 j_i(w) = an d 

w = Xh )e h + (a- Xh^h = Wj + Xh\0h - Si) 


( 73 ) 
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We have S 4ei/(0j+i — dj ) and consequently 7 “(w) < 4ei /(dj — dj- 1 ). Thus, y“(w) > a — ■ This shows 

that in this case on Q n,k , 

7 ,“-i (X*’ n (t)) >a- ciei, t £ T", (74) 

Consequently, by position of X*’ n associated with the region S'*, in case (II), we have by (1731) and (17TI) that the 
minimizing curve is bounded away from X#’ n by order of ei. Now, (1621) is valid for all i (/ {j — 1 ,j,j + 1}, by the 
proof given in case (I). 

For the case j n = 2, if w > wi, the proof is identical. In the case w < wi, by (US}, 7 “ (w) = w/d\. Since wi = ad i, 
it follows ad i — w < 4ei and 72 (w) > a — 4ei /Q\ > a — ciei. This shows that (1731) holds in this case as well. Finally, 
for the case j n = 1, if w > Wj then the proof is similar to the cases I < j n < J. (the fact that 7 “ may assume the 
value zero does not affect the proof). Note that (1621) is valid for all i > 2, by the proof given in case (I). For the 
particular case j n = J we have wj = a* , because this is the maximal buffer load. Thus, we have only the case where 
w < wj, which is treated in the similar way. 

Combining all the estimates for all small e", 

P(C n ’ fc n (max A"{C) > e"}) -4 0 . 

The estimate H} and the bounds (l73l).(l74ll give P(I2 Tl ’ fc D > 2 ci£i}) —0 as n —>■ 00 . This gives 

P(I? n ’ fc D (max|Zi( I (C TI )| > 4 ci£i}) —> 0, 

i<I 

as n —»■ 00 . 

We now determine the constant Co used to define K. The objective is to determine the relation between e', which 
used to set the first time the difference between the scaled process and the minimizing curve is greater than s', 
and £ 1 . We do this in such a way that 4 ci£i < e'/ 2. In particular, any constant Co > 8 cix = 32x/# m i n will do. 

We have bounded zA( l (<C n ) by arbitrarily small e' in case (I) and by £1 in case (II), and set the relation between 
these two bounds. This way we obtain P (f2 n,k ) —7 0 as n —» 00 . 


(Ill) 

0 £ Hi,. The is no difference of this case from case (I) with exception that X n may be at zero at some t. The analysis 
in case (I) results in the same conclusion, namely F(fi n ’ k ) —> 0 as n —> 00 . 


(IV) 

a* £ Sk- This case corresponds to the rejections at free boundary. It is treated by adding a negative term in the 
equations written in case (I). (Note that we can pick e to be sufficiently small to avoid treating (II) here). 

Having shown that P (I2 n ’ k ) —7 0 in all cases, using (1611) and the fact that 5 > 0 is arbitrary completes the proof 
of the lemma. 

As a consequence of the lemma and (1581) . we have P(cr" < T) — > 0 as n —> 00 . Since s' is arbitrary, (1571) is 
established. 

To accomplish the proof one has to show the weak convergence of the costs, i.e. 

poo poo 

/ e~ at [h ■ X n (t) + ar ■ Z n (t)]dt] => / e~ at [h ■ 'y a (X a (t)) + afZ a (t)\dt, 

Jo Jo 

and to show the uniform integrability of rejection and holding cost components to obtain 

lim J n (U n ) = limE^y^ e~ at [h ■ X n (t) + ar ■ Z n (f)]dfj =E e~ at [h ■ 'y a (X a (t)) + fZ“(t)]dt| =V(xo\e) 

Then, the final result follows once V(xq,e) —> V(xo) as £ — > 0 is seen. These technical steps are covered in [3] and 
are omitted here. Therefore, this accomplishes the proof of the theorem. 
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7 Conclusion 

In this paper, we analyzed the problem of service scheduling and admission of multi-class tasks arriving to the shared 
buffer. The main motivation for this problem comes from the growing utilization of cloud computing resources. 
We demonstrated that in the limit, the queuing control is expressed via one-dimensional reduced workload control 
problem. Using the fact the the value function of the reduced and multi-task problem in the limit are coincide, we 
formulated the policy for which we proved, that once applied to the multi-task scaled problem, it attains the upper 
bound for the value function. Hence, we proved that the AO policy constitutes a form of state-space collapse. In 
particular, we demonstrated that the shared buffer is either occupied by two classes of tasks, or it is not full. The 
classes of the tasks are determined by the workload. 

The workload process notion has a particular importance in computing systems. Namely, it indicates the remained 
computing effort till the buffer becomes empty. The AO policy we presented has an advantage of being simple to 
implement, which is important in cloud computing, then the managing overhead is of sensible weight. Moreover, as 
the rate of incoming tasks and cloud computing capabilities increase, the policy becomes more close to the optimal 
one and the optimality gap decreases. We conclude that diffusion approximation method has applicative meaning of 
constantly growing importance in this context. We suggest that other cloud computing configurations and computer 
networks in general can be similarly treated. 
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